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In the paper we study sharp maximal inequalities for martingales and non-negative submartin- 
gales: if /, g are martingales satisfying 

|ds»|<|d/„|, 71 = 0,1,2,..., 

almost surely, then 

sup|5n| <P||/||p, P>2, 

"n>0 "P 

and the inequality is sharp. Furthermore, if a £ [0, 1], / is a non-negative submartingale and g 
satisfies 

|dpn|<|d/n| and |E(dff„+i|7k)| < aE(d/„+i|J r „), 71 = 0,1,2,..., 
almost surely, then 

||sup|s„||| <(a + l)p||/|| p , P >2, 

"n>0 "p 

and the inequality is sharp. As an application, we establish related estimates for stochastic 
integrals and Ito processes. The inequalities strengthen the earlier classical results of Burkholder 
and Choi. 

Keywords: differential subordination; martingale; maximal function; maximal inequality; 
submartingale 

1. Introduction 

The purpose of the paper is to provide the best constants in some maximal inequalities for 
martingales and non-negative submartingales. Let us start with introducing the necessary 
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notation. Let (f2,.F,P) be a non-atomic probability space, equipped with a filtration 
{J~n)n>o, that is, a non-decreasing family of sub-cr-ficlds of T . Let / = (/„) and g = (g n ) 
be adapted, real- valued integrable processes. The difference sequences df = (d/„) and 
dg = (dg n ) of / and g are defined by the equations 

n n 

fn=/]dfk, g n = ^2dg k , n = 0,1,2,.... 

fc=0 fe=0 

We are particularly interested in those pairs (/, g) for which a certain domination relation 
is satisfied. Following Burkholdcr [6], we say that g is differentially subordinate to / if, 
for any n > 0, we have 

Pfcl < |d/n|) = 1. 

As an example, let g be a transform of / by a predictable sequence v = (v n ) bounded in 
absolute value by 1; that is, we have P(|v„| < 1) = 1 and df n = v n dg n , n>0. Here, by 
predictability, we mean that vq is J-o-measurable and v n is J-n-i-measurablc for n > 1. 
In the particular case when each v n is deterministic and takes values in { — 1,1}, we will 
say that g is a ±1 transform of /. 

Another domination we will consider is the so-called a-strong subordination, where 
a is a fixed non-negative number. This notion was introduced by Burkholdcr in [10] in 
the special case a = 1 and extended to a general case by Choi [12]: The process g is 
a-strongly subordinate to / if it is differentially subordinate to / and, for any n > 0, 

|E(d fln+1 |J-„)|<a|E(d/„+i|J-„)| 

almost surely. 

There is a vast literature concerning the comparison of the sizes of / and g under 
the assumption of one of the dominations above and the further condition that / is a 
martingale or non-negative submartingale; we refer the interested reader to the papers 
[6, 9, 10, 12, 15, 16, 18-21] and the references therein. In addition, these inequalities 
have found their applications in many areas of mathematics: Banach space theory [4, 5]; 
harmonic analysis [8, 13, 14]; functional analysis [6, 7, 20]; analysis [1, 2]; stochastic 
integration [6, 11, 17, 20, 21]; and more. To present our motivation, we state here only 
two theorems. Let us start with a fundamental result of Burkholder [6]. We use the 
notation ||/|| p = sup„ ||/„|| P , p£ [l,oo]. 

Theorem 1.1 (Burkholder). Assume that f , g are martingales and g is differentially 
subordinate to f . Then, for any 1 <p < oo, 

NI p <(p*-i)||/IU (i.i) 

where p* = max{p,p/ (p — 1)}. The constant p* — 1 is the best possible; it is already the 
best possible if g is assumed to be a ±1 transform of f . 
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Here, by the optimality of the constant, we mean that for any r < p* — 1 there exists 
a martingale / and its ±1 transform g, for which ||g||p>^||/||p. 

The submartingale version of the estimate above is the following result of Choi [12]. 

Theorem 1.2 (Choi). Assume that f is a non-negative submartingale and g is a- 
differentially subordinate to f , a £ [0, 1] . Then for any 1 < p < oo, 

\\g\\p<(p* a -mf\\p, (i-2) 

where p* = max{(a + l)p,p/(p — 1)}. The constant is the best possible. 

In the paper we deal with a considerably harder problem and determine the optimal 
constants in the related moment estimates involving the maximal functions of / and g. 
For n > 0, let /* = sup 0<fc<Il \fk\ and /* = sup fc>0 |/fc|. Here is our first main result. 

Theorem 1.3. Let f , g be martingales with g being differentially subordinate to f . Then 
for any p>2, 

ll<?X<Pll/ll P (1-3) 

and the constant p is the best possible. It is already the best possible in the following 
weaker inequality: If f is a martingale and g is its ±1 transform, then 

HflX<P||/lp. (1-4) 

Note that the validity of the estimates (1.3) and (1.4) is an immediate consequence of 
(1.1) and Doob's bound ||/*|| p < ^ry||/||p, p > 1. The non-trivial (and quite surprising) 
part is the optimality of the constant p. 

Now let us state the submartingale version of the theorem above. 

Theorem 1.4. Fix a 6 [0, 1] . Let f be a non-negative submartingale and g be real valued 
and a- strongly subordinate to f . Then for any p>2, 

\\g*\\ P <(a + l)p\\f\\ p (1.5) 

and the constant (a + l)p is the best possible. It is already the best possible in the weaker 
estimate 

llffX<(a+l)p||/X. (1-6) 

There is a natural question: What is the best constant in the inequalities above in the 
case 1 < p < 2? Unfortunately, we have been unable to answer it; our reasoning works 
only for the case p>2. 

The proof of (1.5) is based on a technique invented by Burkholdcr in [11]. It enables us 
to translate the problem of proving a maximal inequality for martingales to that of finding 
a certain special function, an upper solution to a corresponding nonlinear problem. The 



1330 



A. Os§kowski 



method can be easily extended to the submartingale setting (see [17]) and we construct 
the function in Section 3. For the sake of construction, we need a solution to a differential 
equation that is analyzed in Section 2. The next two sections are devoted to the proofs 
of the announced results: Section 4 contains the proof of the estimate (1.5) and the final 
part concerns the optimality of the constants appearing in (1.4) and (1.6). In the final 
section, we present some applications: sharp estimates for stochastic integrals and ltd 
processes. 



2. A differential equation 



For a fixed a £ (0, 1] and p > 2, let C = C v . a = [(a + l)p] p (p — 1). A central role in the 
paper is played by a certain solution to the differential equation 

_ -l + C(l- 7 (x)h(z)sP-' 
71 j ~ 1 + C(1-7(z))zp-i ■ [ ' 

Lemma 2.1. There is a solution 7 : [((a + l)p) _1 , 00) — > K of (2.1 ), satisfying the initial 
condition 

7(^) = !-[(« + (2.2) 
The solution is non- decreasing, concave and bounded from above by 1. 



Proof. Let 7 be a solution to (2.1), satisfying (2.2) and extended to a maximal subin- 
terval / of [((a + l)p) _1 , 00). It is convenient to split the proof into a few steps. 

Step 1: I = [((a + l)p) _1 ,oo). In view of the Picard-Lindelof theorem, this will be 
established if we show that 7 < 1 on /. To this end, suppose that the set {x € I : j(x) = 1} 
is non-empty and let y denote its smallest element. Then, by (2.1), we have j'(y) = — 1, 
which, by minimality of y, implies 7 (((a + l)p) _1 ) > 1 and contradicts (2.2). 

Step 2: Concavity of 7. Suppose that the set {x £ I:^f"{x) > 0} is non-empty and let 
z denote its infimum. Consider the functions F, G : (((a + l)p) _1 , 00) — > K given by 

F(x) = 7(2;) — xj'(x), 
G(x)^{l~ 1 (x))x p - 2 . 

Observe that 

G>0 on I and F>0 on (((a + l)p) _1 , z + e) (2.3) 
for some e > 0. The statement about G is clear, while the positivity of F follows from 
F'(x) = - X1 "{x) > 0, xe (((a + l)p)~\ z] 
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and 

F(((a+l)p)~ 1 +) = - >0. 

P 

Now multiply (2.1) throughout by 1 + (7(1 — 7(x))a; p ~ 1 and differentiate both sides. We 
obtain an equality that is equivalent to 

1 "(x)(l + CxG(x)) = CF(x)G'(x), x> ±- . (2.4) 

(q + l)p 

As a first consequence, we have z > ((a + l)p) _1 . To see this, tend with x down to 
((a + l)^)^ 1 and observe that F and G have strictly positive limits; furthermore, 

G'(x) = x p - 3 [{p - 2)(1 - 7(3:)) - ar/(x)] =: a; p ~ 3 J(x) (2.5) 

with J(((a + 1» _1 ) = < 0. Combining (2.3) and (2.4) we see that, for some 

e > 0, G' < on (z — e, z) and G 1 > on (z, z + e). Consequently, by (2.5), J < on 
(z — e, z) and J > on (z, z + e). This implies J'(z) > and since J'(z) = — (p — l)7'(z), 
we get Y(z) < 0. However, this contradicts G'(z) — 0, in view of (2.5) and j(z) < 1. Let 
us stress that here, in the last passage, we use the inequality p > 2. 

Step 3: 7 is non- decreasing. It follows from (2.4), the concavity of 7 and positivity of 
F and G, that G' < 0, or, by (2.5), 

(p-2)(l-7(a;))-s7'(a;)<0. (2.6) 

The claim follows. □ 



Let us extend 7 to the whole half-line [0, 00) by 

7(ar) = [(p-l)(o! + l)-l]a;+-, x 6 

P 



0, 1 



(a + l)p 



It can be verified readily that 7 is of class C 1 on (0, 00). For the sake of reader's conve- 
nience, the graph of 7, corresponding to p = 3 and a = 1, is presented on Figure 1. 

Let H : [((a + l)^)" 1 , 00) — )• [1, 00) be given by H (x) = x + j(x) and let h be the inverse 
to H. Clearly, we have 

x-l<h(x)<x, x>l. (2.7) 
We conclude this section by providing a formula for hi to be used later. As 

h'(x) = * . ,, = \ . ,, , x>l, (2.8) 

it can be derived that, in view of (2.1), 

1 + ((q + 1) P ) P (P - 1H%) - g + l)^^- 1 r , 

W ((a + l)p)P(p-l)(/ l (a;)-a; + l)/i(.T)P- 2 a; ' 1 J 
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Figure 1. The graph of 7 (the bold line) in the case p = 3, a = 1. Note that 7 is linear on 
[0,1/6] and solves (2.1) on (1/6, 00). 

3. The special function 

Throughout this section, a £ (0, 1] and p > 2 are fixed. Let 5 denote the strip [0, 00) x 
[—1,1]. Consider the following subsets of S. 

D = {(x,y)GS:\y\<^(x)}, 

D 1 = {(x,y) eS:\y\> j(x),x+\y\ < 1}, 

D 2 = {(x, y)£S:\y\> j{x),x + \y\ > 1}. 

Introduce the function u : S — » K by 

n-[(a + l)p]V on L>o, 

«(s,y)=| l _( P x+ P\y\~ i y \p(p( a + l)-l)x-p\y\ + l} on D\, 

{ 1 - [(a + l)p]*h(x + \y\) p - x \px - (p - l)h(x + \y\)} on D 2 . 
Let [/ : [0, 00) x R x (0, 00) -> E be given by 

C/(x,y,z) = (|y| Vz) p uf —E—.-JI—Y 

As we will see below, the function U is the key to the inequality (1.5). Let us study 
the properties of this function. 

Lemma 3.1. The function U is of class C 1 . Furthermore, there exists an absolute con- 
stant K such that, for all x > 0,y G K, z > 0, we have 

U(x,y,z)<K(x + \y\+z) p (3.1) 
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and 

U x (x,y,z)<K(x + \y\+z)P-\ U x (x,y, z) < K(x + \y\ + z)^ 1 . (3.2) 

Proof. The continuity of the partial derivatives can be verified readily. The inequality 
(3.1) is evident for those (x, y, z), for which ( n£jj , \ y fv~ ) G U D\\ for the remaining 
(x,y,z), it suffices to use (2.7). Finally, the inequality (3.2) is clear if ( \ y \ Vz , \ v \\j z ) € 
Dq U D\. For the remaining points one applies (2.7) and (2.8), the latter inequality 
implying h' < 1. □ 

Now let us deal with the following majorization property. 

Lemma 3.2. For any (x, y, z) <G [0, oo) x 1 x (0, oo), we have 

U(x, y, z) > (\y\ V zf - [(a + l)p] p x p . (3.3) 

Proof. The inequality is equivalent to u(x, y) > 1— [(a + l)p] p x p and we need to establish 
it only on D\ and D^. On D\, the substitutions X — px and Y =p\y\ — 1 (note that Y > 0) 
transform it into 



(a + l) p X p > {(p(a + 1) - l)X - Y]. 

This inequality is valid for all non- negative X , Y. To see this, observe that by homogeneity 
we may assume X + Y = 1, and then the estimate reads 

F(X) := (a + l) p X p - (p - l)- p+1 [p(a + 1)X - 1] > 0, X E [0, 1]. 

Now it suffices to note that F is convex on [0, 1] and satisfies 

F'( -) 



(p-l)(a+l)J V(P -!)(« + !) 

It remains to show the majorization on Di. It is dealt with in a similar manner: Setting 
s = x + \y\ > 1, we see that (3.3) is equivalent to 

G{x) := x p - hisf- 1 [px - (p - l)h(s)} > 0, s - 1< x < h{s). 

It is easily verified that G is convex and satisfies G(h(s)) = G'(h(s)) = 0. This completes 
the proof of (3.3). □ 



The main property of the function U is the concavity along the lines of slope belonging 
to [-1,1]. 
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Lemma 3.3. For fixed y,Z satisfying z > 0, \y\ < z, and any a G [— 1, 1], the function 
* = ®y,z,a ■ [0, oo ) -> K given fc?/ 

= U(t,y + at,z) 

is concave. 

Before we turn to the proof, let us first establish some useful consequences. 

Corollary 3.4. (i) The function U has the following property: For any x, y, z, k x , k y such 
that x,x + k x > 0, z > 0, \y\ < z and \k y \ < \k x \, we have 

U(x + k x ,y + k y ,z) < U(x,y,z) + U x (x,y,z)k x + U y (x,y,z)k y (3.4) 

(forx = 0, we replace U x (0,y,z) by right-sided derivative U x (0+, y, z)). 
(ii) For any x > 1, we have 

£/(x,l,l) <0. (3.5) 

Proof, (i) This follows immediately. 

(ii) We have $ o ,i,x-i(0) = f7(0,0, 1) = 1 and <J> ,i. x -i (((a + = U(((a + 

a; -1 ((a + l)*)) -1 ,!) = 0, since (((a + tfp)' 1 , x~^ ({a + l)p)-\l) G D . Since x > 1 > 
((a + l)p) _1 , the lemma above gives U(x, 1, 1) = 3>o l x- 1 ( x ) < 0. □ 

Proof of Lemma 3.3. By homogeneity, we may assume z = 1. As $ is of class C 1 , 
it suffices to verify that $"(£) < for those i, for which (t,y + at) lies in the interior 
of D( h D\, D2 or outside the strip S. Since U(x,y,z) = U(x, —y, z), we may restrict 
ourselves to the case y + at > 0. If (t, y + at) belongs to Dg, the interior of Do, then 
= -[("+ !)p] p -P(P~ l)* p " 2 < 0, while for (t,y + at) G we have 

ffi /, rrt P 3 (pt+p(?/ + ^)-l)P- 3 (l + a) ^ 

* w (p-i)p- 2 ( 1 2) ' 

where 

/ x = P t[(p - 2)(1 + o)(p(a + 1) - 1) + 2(p(a + 1) - 1 - a)] > 0, 
^2 = (p(j/ + at) - l)(2a + 1 - a) > 0. 
The remaining two cases are a bit more complicated. If (t, y + at) G then 

Cp(l + a) z 

where 

J 1 = h{t + y + at) p ~ 2 h"(t + y + at)[h(t + y + at)- t], 
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■h = h(t + y + at) p - 3 [h'(t + y + at)] 2 [(p-l)h(t + y + at) - (p-2)i\, 
2 



■h 



o+l 



h(t + y + at) p ~ 2 h'(t + y + at). 



Now if we change y and t, keeping s = t + y + at fixed, then J\ + J 2 + J3 is a linear 
function of t G [s — 1, fr(s)]. Therefore, to prove it is non-positive, it suffices to verify this 
for £ = /i(s) and t — s—1. For f = /i(s), we have 



Ji + J2 + J3 = h(s)P~ 2 ti(s) 



h'(s) 



a + 1 



<0, 



since < h'(s) < 1 (see (2.8)). If t = s — 1, rewrite (2.9) in the form 

Cs(h(s) + 1 - s)h(s) p - 2 h'(s) = 1 + C(h{s) + 1 - s)/i(s) p - 1 
and differentiate both sides; result, we obtain 

2 



Cs 



Ji+J2 + h + h{sf- 2 h'{s) 



a + 1 



1 



Ch(s) p ~ 2 [{h'{s) - l)h(s) + (p - 2)(/i(s) + 1 - s)/i'(a)]. 



As h! > and 2/ (a + 1) > 1, we will be done if we show the right-hand side is non-positive. 
This is equivalent to 

h'{s)[h(s) +(p- 2)(h(s) + 1 - s)] < h(s). 

Now use (2.8) and substitute h(s) = r, noting that h(s) + 1 — s = 1 — 7(7"), to obtain 

r+(p-2)(l-7(r))<r(l+Y(r)), 

or r/(r) > (p — 2)(1 — 7(f)), which is (2.6). 

Finally, suppose that y + at > 1. For such t we have Q(t) = (y + at) p u(t/ (y + at), 1), 
hence, setting X = t/(y + 1), Y = y + at, we easily check that $"(t) equals 

Y p - 2 [p{p - l)a 2 u(X, 1) + 2a(p - 1)(1 - aX)u x {X, 1) + (1 - aX) 2 u xx {X, 1)]. 

First let us derive the expressions for the partial derivatives. Using (2.9), we have 



u x (X, 1) = + + 1) - X>(X + If" 1 ] - + 

A+1 p — 1 

1) = ifcJi [1 + c(MA + 1) - x>(* + ir 1 ] 

Cp/i(X + l)P- 1 Cp/i(X + l)P- 2 ^'(A + l) 



p-1 



X + l 



X + l 
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Now it can be checked that 

&'(t)Y 2 -P/p = K 1 + K 2 + K 3 , 



where 



A'i = (p-1) 



)(yTi) \ l + c ( h (x + i)-x)h{x + if-% 

chjx + iy- 1 2 
k 2 = Y+i — ~ 

l-al\ 2 l + C(/i(X + l) + 



K: 



< 



X + l J h(X + l)-X 



We may write 

~(x + iy 



K 2 + K 3 < - Ch (* + })* [(1 + 2a - a 2 X)(X + 1) + (1 - aX) 2 ] 



(x + iy 



where, in the last passage, we used a < 1. On the other hand, as h is non-decreasing, we 
have 

_ Ch(l)P < Ch(X + l)P- 1 h(l) 
p — 1 ~~ p — 1 

Moreover, since x i— > h(x + 1) — x is non-increasing (see (2.8)), we have /ipT + 1) — X < 
h(l). Combining these two facts, we obtain 

K, < (p - 1) ( yty) [i + ^UW* + ir 1 ] 

<(^)W + ir\ 

as ph(l) = (a + 1) _1 < 1. This implies K\ + K 2 + K 3 < and completes the proof. □ 



The hnal property we will need is the following. 
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Lemma 3.5. For any x, y, z such that x >0, z > and \y\ < z, we have 

U x (x,y,z)<-a\U y (x,y,z)\ (3.6) 
(ifx = 0, then U x is replaced by a right-sided derivative). 

Proof. It suffices to show that for fixed y,z, \y\ < z, and a £ [—a, a], the function 
$ = &y, z ,a'- [0, oo) — > R given by = U(t,y + at,z) is non-increasing. Since a < 1, 
we know from the previous lemma that $ is concave. Hence all we need is $'(0+) < 0. 
By symmetry, we may assume y > 0. If y < 1/p, then the derivative equals 0; in the 
remaining case, we have 



4. The proof of (1.5) 

First let us observe that it suffices to show (1.5) for strictly positive a. This is an immedi- 
ate consequence of the fact that a-strong subordination implies a'-strong subordination 
for a < a' . 

Suppose /, g are as in Theorem 1.4. We may restrict ourselves to the case < oo. 
Hence, by Choi's inequality (1.2), we have \\g\\ p < oo. It suffices to show that for any 
n = 0, 1, 2, . . . we have 

(a + 1)W£]<0. 

Clearly, we may assume that P(#o > 0) = 1, simply replacing /, g by / + e, g + e if 
necessary (here e is a small positive number). In particular, this implies /q > almost 
surely. In view of the majorization (3.3), we will be done if we show that the expectation 
¥,U(f n ,g n ,g*) is non-positive for any n. As a matter of fact, we will show more; namely, 
that the process (U(f n ,g n , <?^)n>o) is a supcrmartingalc and E[/(/ , 5o 5 9o) < 0. 

To this end, fix n > 1 and observe that <j* < |<7o| + |<?i| + • • • + \g n \, so g* be- 
longs to LP. Thus, by Lemma 3.1 and Holder's inequality, the variables U(f n ,g ni g*), 
U x (fn-l,9n-i,9n-i)^fn and U y (f n -i,9n-i, d 5« are integrable. Moreover, by defi- 
nition of U and the inequality (3.4), 

)l^n-x) 

= E(C/(/ n _ 1 + d/ n ,g n _i + dg„,3*_ 1 )|J"„_i) 

< E[J7 (/n-X, ffn-X, fl£_ x) + U x (fn-1, 9n-l, 9n-l) d/n 

+ (/n - x , 5n- x , g„ - 1 ) dg n I J"„_ i] 

< ^(/n-X,5n-X,5^_i)- 
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The latter inequality is the consequence of the following. By (3.6) and the submartingale 
property of /, 

E(t r x(/«-i,.9n-i,3^-i)d/„|J r „-i) = [/ 2; (/„_i,.g n _i,g*_ 1 )E(d/ n |J r „_i) 

< -a|E/j / (/ n _i,flr n _i,3*_ x )|E(d/ n |J'n-i) 

< -U y (f n - 1 ,g n - 1 ,g^ 1 )E(dg n \J 7 n - 1 ) 
= -E( U y (/„_ i , i , g* _ ! ) dg n \ J"„_ 1 ) , 

where the second inequality is due to a-domination. 

To complete the proof, it suffices to show that E[/(/ , g , g^) < 0. However, t/(/o, go, go) = 
U{/ot 9o, go) = 9oU( fo/go, 1, 1) almost surely and the estimate follows from Corollary 
3.4(ii). 

5. Sharpness 

We start with inequality (1.4) and restrict ourselves to the case when g is a ±1 transform 
of /. Suppose the best constant in this estimate equals /3 > 0. This implies the existence 
of a function W:RxRx [0, oo) x [0, oo) — > R, which satisfies the following properties: 

^(1,1,1,1) <0, (5.1) 
W(x,y,z,w) = W(x,y, \x\V z,\y\V w), if x, y G R, w, z > 0, (5.2) 
(|y| Vu>) p -/3 p (|x| Vz) p < W{x,y,z,w), if G R,w,z > (5.3) 

and, furthermore, 

aF+(:r + ii, y + eti, z,w) + (1 — a)W(x + t 2 , y + et 2 , z,w) < W(x,y, z, w) 

(5.4) 

for any \x\ < z, \y\ < w, e G { — 1, 1}, a G (0, 1) and ii, t2 with aii + (1 — a)t 2 = 0. 
Indeed, one puts 

W(x, y, z, w) = sup{E(. 9 ; V wf - 0*E(/* V z) p }, (5.5) 

where the supremum is taken over all integers n and all martingales /, g satisfying 
P((/o,3o) = { X )V)) = 1 an d d/fe = id^fc, fc = 1,2, . . . (sec [11] for details). This formula 
allows us to assume that W is homogeneous: W(tx,ty,tz,tw) = tW(x,y,z,w) for all 
a;, y G R, z, u; > and t > 0. 

Now the idea is to exploit the above properties of W to get (3>p. To this end, let S be 
a small number belonging to (0, 1/p). By (5.4) applied to x = 0, y = w = l, z = 6/(1 + 25), 
e = 1 and t\ = 5, t 2 = — l/p, we obtain 

i / * (5 - 6) 

l+p<5 V 1 + 25 
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1 . 



W\ — , 1 , — 

p' p'i + 2<y' 
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(5.7) 



Furthermore, by (5.2), 



W 5,1 + 5, 



5 



1 + 25' 



1 + 5 = W(5,l + 5,5,l + 5), 



which, by (5.4) (with x = z = 5, y = w = 1 + 5, e = — 1 and t\ = —5, t 2 = | + 5(i — 1)), 
can be bounded from below by 



pS 



w 



1 + 5 
P 



,1 



1 



1 + 5 

Using (5.3), we get 

'1 + 5 



+ 6{ 2 



,5,1 + 5 



1 + 5 - p5 
1 + 5 



W(0, 1 + 25,5, 1 + 5). 



5 2 



,5,1 + 5 >(l + 5) p 



P 



Furthermore, by (5.2) and the homogeneity of W, 

W(0, 1 + 25,5,1 + 5) = W(0, 1 + 25, 5, 1 + 25) = (1 + 25fW (0, 1, — ^— , 

V 1 + 25 

Now plug all the above estimates into (5.6) to get 



W 0,1, 



1 + 25 



,1 



(l + 5-p5)(l + 25) p 



(l+5)(l+p5) 



> 



p5 



P 



(5.8) 



1 +p5 

Now it follows from the definition (5.5) of W that 

wU,i, 



(i + (i + sy- 1 ) 



,1 <W(0, 1,0,1). 



1 + 25' 

Furthermore, one easily checks that the function 

(l + s-ps){l + 2s)P 



F(s) = 1 - 



(l + s){l+ps) ' 



s> , 

p 



satisfies F(0) = F'(0) = 0. Hence 
1 



p\ p W(0,l,0,l)-F(5)-(l+p5) 



p5(l + (l + 5)p- 1 ) 
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and letting 5 -> yields 1 - (|)*> < 0, or /3 > p. 

The reasoning for the inequality (1.6) is essentially the same: suppose the best constant 
in the estimate equals 7 > 0. Introduce the function V : [0, 00) x R x [0, 00) x [0, 00) — > M 

by 

V(x, y, z, w) = sup{E(. g ; V wf - 7 ?E(/ r : V zf}, 

where the supremum is taken over all integers n, all non- negative submartingales / and 
all integrable sequences g satisfying P((/o, go) = (x, y)) = 1 and, for fe = 1, 2, . . . , 

|d/fe| > \dg k \, aE(df k \T k -i) > \E(dg k \ T k ^)\ 

with probability 1. We see that V is homogeneous and satisfies the properties analogous 
to (5.1)-(5.4) (with obvious changes: in (5.2) and (5.3) one must assume x > 0; in (5.3) 
the number /? is replaced by 7; and, in (5.4), we impose x,x + t\,x + t2 > 0). In addition, 
there is an extra property of V, which corresponds to the fact that we deal with the 
inequality for submartingales: 



V(x + d, y + ad, z,w) < V{x, y,z,w), if x > 0, y € K, w, z > 0, d > 0. 



(5.9) 



Now fix S G (0, l/p) and apply this property with x = 0, y = w = 1, z = 6/(1 + (a+ l)p), 
d = 8 and then use (5.2) to obtain 



V 0,1, 



l + (a + l)<T 



1 >V [S,l + aS, 



l + {a + l)5 : 
= V(S,l + aS, 5,1 + a5). 

Using (5.2), (5.3) and (5.4) as above, we have 

1_ ((^Ti>) . 



(5.10) 



V(8, 1 + a8, S, 1 + aS) > S ^ + 1 } P (1 + aSf 

1 + ao 



+ (a + l)6 



• 1 



which, combined with (5.10), gives 

5 



V 0,1, 



1 



l + (a + l)S 
>6(a + l)p{l + a5) p - 1 

Now it suffices to use 

V\ 



^ 1 + ^-^ + 1)^ 
1 + ao 



7 



{a + l)p 



l + (a + l)<5' 



1 <V(0, 1,0,1) 
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and the fact that the function 



„, n 1 + cts — s(a + l)v , . 

G(s) = l i ^1+a + lsf, s>-l/a, 

1 + as 



satisfies G(0) = G"(0) = 0, to obtain 



7 \ P < V(0,1,0,1)G(6) 



(a + l)pj ~ 5(a + l)p(l + a5)P- 1 ' 
Letting 5 — >• gives 1 — ( ( a +i) p ) P < 0, or 7 > (a + l)p. This completes the proof. 

6. Inequalities for stochastic integrals and Ito 
processes 

In this section we present applications of the results above. Theorem 1.4 in the special 
case a = 1 yields an interesting inequality for the stochastic integrals. Suppose (f2, T,F) is 
a complete probability space, filtered by a non-decreasing right-continuous family (J 7 t)t>o 
of sub-cr-fields of T . In addition, let T§ contain all the events of probability 0. Suppose 
X = (X t )t>a is an adapted non-negative right-continuous submartingale with left limits 
and let Y be the Ito integral of H with respect to X, 



Y t =H X a + / H s dX 8 , t>0. 
J(o,t] 

Here H is a predictable process with values in [—1, 1]. Denote \\X\\ p = sup t>0 ||^t|| P and 
X* — sup t >Q \Xt\. We will establish the following extension of Theorem 1.4. 

Theorem 6.1. Under the above conditions, we have, for any p>2, 

\\Y*\\ p <2p\\X\\ p , (6.1) 

and the constant 2p is the best possible. It is already the best possible in the weaker 
estimate 

\\Y*\\ p <2p\\X*\\ p . 

Proof. The constant 2p is optimal even in the discrete-time setting, so all we need is 
to show (6.1). This is a consequence of the approximation results of Bichteler [3]. We 
proceed as follows: Consider the family Y of all processes Y of the form 

n 

Y t = H X + h k [X rkM - X Tk _ lM ], (6.2) 
fc=i 



1342 



A. Os§kowski 



where n is a positive integer, hk belongs to [—1, 1] and the stopping times take only 
a hnitc number of finite values, with = tq < t% < ■ ■ ■ < r„. Let 

/ = {X To , X Tl , . . . , X Tn , X Tn , . . .) 

and let g be the transform of / by (Hq, hi, h%, . . . , h n ,0, 0, . . .). In virtue of Doob's optional 
sampling theorem, / is a submartingale. Therefore, by Theorem 1.4, if r„ < t almost 
surely, then for Y as in (6.2), 

\\Y*\\ p =\\g*\\ p <2p\\f n \\ p <2p\\X t \\ p . 

Now we have that X and H satisfy the conditions of Proposition 4.1 of Bichtclcr [3]. 
Thus by (2) of that proposition, if Y is as in the statement of the theorem above, then 
there is a sequence (Y J ') of elements of Y such that lim^o^Y^ — Y)* = almost surely. 
Hence, by Fatou's lemma, 

||Y t *|| p <2p|LY t || p . 

Now take t — > oo to complete the proof. □ 

The result above can be further strengthened. Assume that X is a non-negative sub- 
martingale and X = Xq + M + A stands for its Doob-Meyer decomposition, uniquely 
determined by the condition that A is predictable. Let a G [0, 1] be fixed and suppose 
0, ip are predictable processes satisfying \<f> s \ < 1 and \ijj s \ <a for all s. Consider the ltd 
process Y such that |Yo| < Xq and 

Y t =Y + / (t>sdM 3 + f ip s dA a 

for all t > 0. We have the following sharp bound. 

Theorem 6.2. For X , Y as above, we have 

\\Y*\\ p <(a + l)p\\X\\ p 

and the inequality is sharp. So is the weaker estimate 

\\Y*\\ p <(a + l)p\\X*\\ p . 

This result can be established using essentially the same approximation arguments as 
above; we omit the details. We would only like to mention here that there is an alternative 
way of proving Theorems 6.1 and 6.2, based on Ito's formula applied to the function u 
(as the function is not of class C 2 , one needs some additional "smoothing" arguments to 
overcome this difficulty). Sec [19] or [20] for similar reasoning. 
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